智能论文笔记

Traffic Congestion Prediction using Deep Convolutional Neural Networks: A Color-coding Approach

Mirza Fuad Adnan , Nadim Ahmed , Imrez Ishraque , Md. Sifath Al Amin , Md. Sumit Hasan

分类：计算机视觉 | 人工智能

2022-09-16

由于计算机视觉的最新进展，流量视频数据已成为限制交通拥堵状况的关键因素。这项工作为使用颜色编码方案提供了一种独特的技术，用于在深度卷积神经网络中训练流量数据之前。首先，将视频数据转换为图像数据集。然后，使用您只看一次算法进行车辆检测。已经采用了颜色编码的方案将图像数据集转换为二进制图像数据集。这些二进制图像被馈送到深度卷积神经网络中。使用UCSD数据集，我们获得了98.2％的分类精度。

translated by 谷歌翻译

Syntax-Guided Program Reduction for Understanding Neural Code Intelligence Models

Md Rafiqul Islam Rabin , Aftab Hussain , Mohammad Amin Alipour

分类：机器学习

2022-05-28

神经代码智能（CI）模型是不透明的黑盒，几乎没有关于他们在预测中使用的功能的见解。这种不透明度可能会导致他们的预测不信任，并阻碍其在安全至关重要的应用中的广泛采用。最近，已经提出了输入程序减少技术来识别输入程序中的关键功能，以提高CI模型的透明度。但是，这种方法是语法 - 乌纳威，不考虑编程语言的语法。在本文中，我们采用了语法引导的减少技术，该技术在减少过程中考虑了输入程序的语法。我们对不同类型输入程序的多个模型进行的实验表明，语法引导的减少技术更快，并且在简化程序中提供了较小的关键令牌集。我们还表明，关键令牌可用于生成对抗性示例，最多可用于65％的输入程序。

translated by 谷歌翻译

GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection

Sowmen Das , Md. Saiful Islam , Md. Ruhul Amin

分类：计算机视觉 | 机器学习

2021-12-08

法医分析取决于从操纵图像识别隐藏迹线。由于它们无法处理功能衰减和依赖主导空间特征，传统的神经网络失败。在这项工作中，我们提出了一种新颖的门控语言注意力网络（GCA-NET），用于全球背景学习的非本地关注块。另外，我们利用所通用的注意机制结合密集的解码器网络，以引导在解码阶段期间的相关特征的流动，允许精确定位。所提出的注意力框架允许网络通过过滤粗糙度来专注于相关区域。此外，通过利用多尺度特征融合和有效的学习策略，GCA-Net可以更好地处理操纵区域的比例变化。我们表明，我们的方法在多个基准数据集中平均优于最先进的网络，平均为4.2％-5.4％AUC。最后，我们还开展了广泛的消融实验，以展示该方法对图像取证的鲁棒性。

translated by 谷歌翻译

HS-BAN: A Benchmark Dataset of Social Media Comments for Hate Speech Detection in Bangla

Nauros Romim , Mosahed Ahmed , Md Saiful Islam , Arnab Sen Sharma , Hriteshwar Talukder , Mohammad Ruhul Amin

分类：自然语言处理

2021-12-03

在本文中，我们展示了HS-BAN，Bangla语言的二进制类仇恨语音（HS）数据集组成，包括超过50,000名标签评论，其中包括40.17％的仇恨和休息是非仇恨的。在准备DataSet时，遵循严格和详细的注释指南，以减少人类注释偏见。 HS DataSet还预处理了语言上，以提取不同类型的俚语，目前人们使用符号，首字母缩略词或替代拼写来编写。这些俚语被进一步分为传统和非传统俚语列表，并包含在本文的结果中。我们探讨了传统的语言特征和基于神经网络的方法，为Bangla语言开发仇恨语音检测的基准系统。我们的实验结果表明，现有的单词嵌入模型培训的型号训练，而不是用正式文本接受培训的模型。我们的基准测试显示，FastText非正式单词嵌入顶部的BI-LSTM模型实现了86.78％F1分数。我们将使数据集提供可供公共使用。

translated by 谷歌翻译

Encoding Program as Image: Evaluating Visual Representation of Source Code

Md Rafiqul Islam Rabin , Mohammad Amin Alipour

分类：机器学习

2021-11-01

在神经模型的输入向量中编码源代码有几种方法。这些方法尝试在其编码中包含输入程序的各种句法和语义特征。在本文中，我们调查Code2Snapshot，这是基于输入程序的快照的源代码的新颖表示。我们评估此表示的几种变体，并将其与利用输入程序的丰富的句法和语义特征的最先进的表示性能进行比较。我们对代码2SNAPSHOT在代码摘要任务中的实用性的初步研究表明，输入程序的简单快照对最先进的表示具有可比性。有趣的是，模糊的输入程序对Code2sNapshot性能的影响微不足道，这表明，对于一些任务，神经模型可以通过仅仅依赖于输入程序的结构来提供高性能。

translated by 谷歌翻译

Memorization and Generalization in Neural Code Intelligence Models

Md Rafiqul Islam Rabin , Aftab Hussain , Mohammad Amin Alipour , Vincent J. Hellendoorn

分类：机器学习

2021-06-16

深层神经网络（DNN）越来越多地用于软件工程和代码智能任务。这些是强大的工具，能够通过数百万参数从大型数据集中学习高度概括的模式。同时，它们的大容量可以使他们容易记住数据点。最近的工作表明，当训练数据集嘈杂，涉及许多模棱两可或可疑的样本时，记忆风险特别强烈表现出来，而记忆是唯一的追索权。本文的目的是评估和比较神经代码智能模型中的记忆和概括程度。它旨在提供有关记忆如何影响神经模型在代码智能系统中的学习行为的见解。为了观察模型中的记忆程度，我们为原始训练数据集增加了随机噪声，并使用各种指标来量化噪声对训练和测试各个方面的影响。我们根据Java，Python和Ruby Codebase评估了几种最先进的神经代码智能模型和基准。我们的结果突出了重要的风险：数百万可训练的参数允许神经网络记住任何包括嘈杂数据，并提供错误的概括感。我们观察到所有模型都表现出某些形式的记忆。在大多数代码智能任务中，这可能会很麻烦，因为它们依赖于相当容易发生噪声和重复性数据源，例如GitHub的代码。据我们所知，我们提供了第一个研究，以量化软件工程和代码智能系统领域的记忆效应。这项工作提高了人们的意识，并为训练神经模型的重要问题提供了新的见解，这些问题通常被软件工程研究人员忽略。

translated by 谷歌翻译

Using BERT Embeddings to Model Word Importance in Conversational Transcripts for Deaf and Hard of Hearing Users

Akhter Al Amin , Saad Hassan , Cecilia O. Alm , Matt Huenerfauth

分类：自然语言处理 | 人工智能

2022-06-24

聋哑人在看直播电视时经常依靠字幕来聋。实时电视字幕通过使用各种标题评估指标的监管机构评估。但是，字幕评估指标通常不会由DHH用户的偏好或字幕有多有意义。有必要构建字幕评估指标，以考虑成绩单中单词的相对重要性。我们在现有语料库中的两种类型的单词嵌入和人类宣传的单词形象分数之间进行了相关分析。我们发现，使用BERT生成的归一化情境化嵌入与基于Word2VEC的单词嵌入更好的与手动注释的重要性分数更好的相关性。我们提供了单词嵌入及其人类宣布的重要性分数的配对。我们还通过训练单词重要性模型来提供概念验证效用，在6级单词重要性分类任务中达到0.57的F1得分。

translated by 谷歌翻译

A Disability Lens towards Biases in GPT-3 Generated Open-Ended Languages

Akhter Al Amin , Kazi Sinthia Kabir

分类：自然语言处理 | 人工智能 | 机器学习

2022-06-23

语言模型（LM）在全球许多基于语言的应用空间中变得普遍。尽管这些LMS正在改善我们与数字产品的日常互动，但无论是开放式语言还是由这些模型生成的文本仍然揭示了对特定人群的任何偏见，因此仍然存在担忧，从而冒着某种产品的可用性风险。有必要确定这些模型是否具有偏见以改善这些模型的公平性。这一差距激发了我们正在进行的工作，在该工作中，我们通过残疾镜头测量了GPT-3生成的文本的两个方面。

translated by 谷歌翻译

Exploring the Use of Data-Driven Approaches for Anomaly Detection in the Internet of Things (IoT) Environment

Eleonora Achiluzzi , Menglu Li , Md Fahd Al Georgy , Rasha Kashef

分类：机器学习

2022-12-31

The Internet of Things (IoT) is a system that connects physical computing devices, sensors, software, and other technologies. Data can be collected, transferred, and exchanged with other devices over the network without requiring human interactions. One challenge the development of IoT faces is the existence of anomaly data in the network. Therefore, research on anomaly detection in the IoT environment has become popular and necessary in recent years. This survey provides an overview to understand the current progress of the different anomaly detection algorithms and how they can be applied in the context of the Internet of Things. In this survey, we categorize the widely used anomaly detection machine learning and deep learning techniques in IoT into three types: clustering-based, classification-based, and deep learning based. For each category, we introduce some state-of-the-art anomaly detection methods and evaluate the advantages and limitations of each technique.

translated by 谷歌翻译

A Comparison Study of Deep CNN Architecture in Detecting of Pneumonia

Al Mohidur Rahman Porag , Md. Mahedi Hasan , Dr. Md Taimur Ahad

分类：计算机视觉 | 机器学习

2022-12-30

Pneumonia, a respiratory infection brought on by bacteria or viruses, affects a large number of people, especially in developing and impoverished countries where high levels of pollution, unclean living conditions, and overcrowding are frequently observed, along with insufficient medical infrastructure. Pleural effusion, a condition in which fluids fill the lung and complicate breathing, is brought on by pneumonia. Early detection of pneumonia is essential for ensuring curative care and boosting survival rates. The approach most usually used to diagnose pneumonia is chest X-ray imaging. The purpose of this work is to develop a method for the automatic diagnosis of bacterial and viral pneumonia in digital x-ray pictures. This article first presents the authors' technique, and then gives a comprehensive report on recent developments in the field of reliable diagnosis of pneumonia. In this study, here tuned a state-of-the-art deep convolutional neural network to classify plant diseases based on images and tested its performance. Deep learning architecture is compared empirically. VGG19, ResNet with 152v2, Resnext101, Seresnet152, Mobilenettv2, and DenseNet with 201 layers are among the architectures tested. Experiment data consists of two groups, sick and healthy X-ray pictures. To take appropriate action against plant diseases as soon as possible, rapid disease identification models are preferred. DenseNet201 has shown no overfitting or performance degradation in our experiments, and its accuracy tends to increase as the number of epochs increases. Further, DenseNet201 achieves state-of-the-art performance with a significantly a smaller number of parameters and within a reasonable computing time. This architecture outperforms the competition in terms of testing accuracy, scoring 95%. Each architecture was trained using Keras, using Theano as the backend.

translated by 谷歌翻译